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Quadratic response theory for spin-orbit coupling in semiconductor heterostructures 

Bradley A. ForemarQ 

Department of Physics, Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China 

This paper examines the properties of the self-energy operator in lattice-matched semiconduc- 
tor heterostructures, focusing on nonanalytic behavior at small values of the crystal momentum, 
which gives rise to long-range Coulomb potentials. A nonlinear response theory is developed for 
nonlocal spin-dependent perturbing potentials. The ionic pseudopotential of the heterostructure is 
treated as a perturbation of a bulk reference crystal, and the self-energy is derived to second order 
in the perturbation. If spin-orbit coupling is neglected outside the atomic cores, the problem can 
be analyzed as if the perturbation were a local spin scalar, since the nonlocal spin-dependent part 
of the pseudopotential merely renormalizes the results obtained from a local perturbation. The 
spin-dependent terms in the self-energy therefore fall into two classes: short-range potentials that 
are analytic in momentum space, and long-range nonanalytic terms that arise from the screened 
Coulomb potential multiplied by a spin-dependent vertex function. For an insulator at zero tem- 
perature, it is shown that the electronic charge induced by a given perturbation is exactly linearly 
proportional to the charge of the perturbing potential. These results are used in a subsequent paper 
to develop a first-principles effective-mass theory with generalized Rashba spin-orbit coupling. 

PACS numbers: 73.21.-b, 73.61.Ey, 71.15.Ap 
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I. INTRODUCTION 

The Rashba Hamiltonian 1 . is the prototype of a 
class of effective-mass Hamiltonians describing spin- 
orbit coupling in semiconductors. 2 These models have 
been under intensive study in the past several years 
due to theoretical and experimental advances in spin- 
related phenomena such as the intrinsic spin Hall 
effect 3 ! 4 . 5 . 6 . 7 . 8 . 9 . 10 ! 11 ! 12 ! 13 ! 14 ! 15 ! 16 ! 17 and the spin galvanic 
and circular photogalvanic effectsi 18 i 19 ' 20 i 21 ' 22 i 23 ' 24 i 25 
Such effects are generated by spin-orbit coupling terms 
in the conduction or valence bands of clean nonmagnetic 
semiconductors. Although a variety of different two- and 
three-dimensional semiconductor systems are under in- 
vestigation, one of the most widely studied is a hetero- 
structure between semiconductors with the zinc-blende 
structure, in which an external electric field can be used 
to tune the relative contributions from the Rashba and 
Dresselhaus spin-splitting terms ,2^2^21 

In a two-dimensional effective-mass model, the Rashba 
coupling has no coordinate dependence. But in a 
three-dimensional theory, it is usually separated into 
(1) a contribution proportional to the macroscopic elec- 
tric field generated by gate voltages, dopants, and 
free carriers; and (2) S functions representing the 
contribution from the rapid change in potential at 
a heterojunction, 2 ! 28 ! 29 ! 30 ! 31 ! 32 ! 33 ! 34 ! 35 ! 36 ! 37 ! 38 ! 39 However, 
the assumption that a heterojunction can be represented 
by a short-range S potential has never been justified 
from first principles. In a self-consistent theory with 
electron-electron interactions, there are in general long- 
range Coulomb multipole potentials that are not well lo- 
calized at the interface. These long-range potentials con- 
tribute spin-dependent terms to the Hamiltonian. Thus, 
it is important to establish the conditions under which 
such terms will appear in the effective-mass Hamiltonian 
for a heterojunction. 



This paper examines the long-range terms in the 
self-energy of a quasiparticle for the case of a lattice- 
matched semiconductor heterostructure. The potential 
energy of the ions is described using norm-conserving 
pseudopotentialsi^i^ii 4 ^ The heterostructure pseudopo- 
tential is treated as a perturbation of a bulk refer- 
ence crystal, with the self-energy calculated to sec- 
ond order in the perturbation using quadratic response 
theory i 43 i 44 i 45 i 46 i 4 7i 48 This approach is well justified nu- 
merically, since the linear response alone has been shown 
to give excellent predictions for the valence band offset 
in a variety of material systems, including isovalent and 
heterovalent heterostructuresi 4 ?i 50 ' 51 i 52 i 53 i 54 ' 55 i 56 ' 57 

The approach used here follows closely earlier work 
by Sham 5 ^ on the theory of shallow impurity states in 
bulk semiconductors. Sham's work is generalized to in- 
clude nonlocal spin-dependent potentials and terms of 
higher order in the crystal momentum. Even for local 
spin-independent potentials, the present work includes 
terms neglected in Sham's analysis, such as dipole po- 
tentials in the quadratic response. 

This paper is limited to a study of the electron self- 
energy in the limit of small crystal momentum. The 
derivation of an effective-mass Hamiltonian from these 
results is presented in the following paper 

As a workable approximation scheme, the calculation 
of the linear response is carried out to terms two orders 
in q higher than the lowest nonvanishing term, where 
q is the crystal momentum transfer of the perturbing 
potential. The quadratic response is calculated to the 
same order in q as the lowest nonvanishing term in the 
linear response. (See the following papes^ 9 - for further 
discussion of this approximation scheme.) Three classes 
of heterostructure perturbations are considered: 

(I) Heterovalent perturbations with nonzero charge. In 
this case the perturbing potential includes a monopolc of 
0(q~ 2 ), and the analysis is performed to an accuracy of 
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O(q ) in the linear response and 0(q 2 ) in the quadratic 
response. 

(II) Isovalent perturbations for which the linear re- 
sponse has a nonzero dipole moment. In this case the 
leading term in the linear response is 0(q^ 1 ), so the lin- 
ear response is evaluated to 0(q) and the quadratic re- 
sponse is evaluated to 0(q^ 1 ). 

(III) Isovalent perturbations for which the linear re- 
sponse has no dipole moment. In this case the leading 
term in the linear response is 0(q°), so the linear re- 
sponse is evaluated to 0(q 2 ) and the quadratic response 
is evaluated to O(q ). 

The perturbations that make up a given heterostruc- 
ture are generally a mixture of classes I, II, and III. The 
simplest situation is that of an isovalent heterostructure 
made up of semiconductors with the zinc-blende struc- 
ture, such as GaAs/AlAs or InAs/GaSb. In this case, 
every ionic perturbation is an isovalent perturbation from 
class III. 

Most theoretical and experimental studies of the 
Rashba spin-splitting Hamiltonian have dealt with this 
type of heterostructure. This case is therefore studied in 
greatest detail here, by working out the explicit form of 
the self-energy from crystal symmetry. The results show 
that in this case the Rashba Hamiltonian contains only 
short-range terms (to within the accuracy of the stated 
approximation scheme). However, there are long-range 
spin-dependent terms that are not of the Rashba form. 

In a heterovalent system such as Ge/GaAs, the ionic 
perturbations are from class I. However, since macro- 
scopic accumulations of charge are energetically unfa- 
vorable, real heterostructures tend to be macroscopically 
neutrali£i*Si Such a nominally heterovalent class I prob- 
lem can therefore often be reduced to an isovalent class 
II or III problem by grouping the ions together in neu- 
tral clusters and treating these clusters as the basic unit. 
This approach is discussed further in Appendix lAl 

In wurtzite heterostructures such as GaN/AIN, the 
ionic perturbations are from class II, since the site sym- 
metry of atoms in the wurtzite structure (space group 
Cq v ) permits a dipole moment. These dipole terms pro- 
duce spontaneous polarization along the hexagonal c axis 
in bulk wurtzite crystals, leading to macroscopic inter- 
face charge at heterojunctions. 61 i 62 i 63 i 64 i 65 i 66 i 67 i 68 Such 
charge produces macroscopic electric fields that generate 
different piezoelectric strain fields in different materials. 
The present theory, which is restricted to lattice-matched 
heterostructures, is therefore not generally applicable to 
wurtzite systems (except in the unrealistic special case^i 
where the interface polarization charge is exactly can- 
celled by an external interface charge). However, the 
results derived here provide a first step towards a more 
general theory dealing with lattice-mismatched hetero- 
structures. 

The paper begins in Sec. [H] by establishing the basic 
definitions and notation for the Green function and self- 
energy that are used throughout the paper. The finite- 
temperature formalism is used (both for generality and 



because it facilitates the derivation of Ward identities), 
although the main interest of this paper is the limit of 
an insulator at zero temperature. In Sec. IIIII the self- 
energy is expanded in powers of the perturbing potential 
using vertex functions. A set of Ward identities is derived 
for the vertex functions at finite and zero temperature. 
Section llVl presents general expressions for the nonlinear 
density response, including Ward identities for the static 
polarization. The bare ionic perturbations are screened 
in Sec. E] where the proper vertex functions and proper 
polarization are introduced. 

A detailed analysis of the small wave vector proper- 
ties of the linear screened potential is carried out in Sec. 
EJfor the special case of a local spin-independent per- 
turbation. The quadratic response for the same case is 
considered in Sec. I VI il and the linear and quadratic con- 
tributions to the self-energy are derived in Sec. IVIIII In 
Sec. IIXI it is shown that in the norm-conserving pseudo- 
potential formalism, the contributions from the nonlocal 
spin-dependent part of the perturbing potential merely 
renormalize the contributions from the local part of the 
perturbation. The main results of the paper are discussed 
and summarized in Sec. Ixl 



II. GREEN FUNCTION AND SELF-ENERGY 

This section establishes the notation, basic definitions, 
and symmetry properties of the Green function and self- 
energy used in subsequent sections of the paper. The 
starting point is the definition of the one-particle thermal 
Green function 69 i 70 i 71 

G ss ,(x,t;x',t') = -(T T [v3 s (x,r)^,(x',r')]), (2.1) 

where s = ±A labels the z component of the spin, 
t is the imaginary time, T T is the time ordering op- 
erator, and V» s (x, r) = e Kr ip s (x)e~ KT and $J(x,t) = 
e KT tpl(x.)e~ Kr are field operators in the Heisenberg pic- 
ture. The angular brackets denote a thermal average 

(6) =e pn Tr(e- f} *d), (2.2) 

where (3 = 1/ksT is the inverse temperature, Tr denotes 
a trace over the many-particle Fock space, K = H — fj,N 
is the grand Hamiltonian (with /i the chemical potential 

and N the number operator), and e~" = Tr(e~P K ). 
The many-particle Hamiltonian is defined by 

H = Y,ff #( X )>W ( x > x ')Vv {*')d 3 x d 3 x' 

s.s ; 

+ 1 E ffm^^^M^ (2 . 3 ) 

2 , J J x x 

where h = Jv is the Hamiltonian of a single noninteract- 
ing particle, and Hartree atomic units are used. 
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Since K is time independent, G has the form 
G ss /(x,t;x',t') = G ss >(x,x',t-t'), with G ss , (x, x' , t - 
f3) = -G ss '(x, x',r) for < r < /3. This permits the 
Fourier series representation (for —j3 < t < (3) 



G ( r ) = « E G (C«K 

n— — oo 

G(Cn) = / G(r)e<" T dT., 
Jo 



(2.4a) 



(2.4b) 



where Cn = (2ji + l)7r//3, and G(r) denotes a single- 
particle operator whose matrix elements in the |x, s) ba- 
sis are G ss i (x, x', r). A continuous Green function G(w) 
may then be defined by analytic continuation of G(£„) 
from the discrete frequencies to = /i + «Cn- 

The Green function G{u>) satisfies Dyson's equation 

[w-h- E(w)]G(w) = G(lo)[lo -h- E(w)] = 1, (2.5) 

which is an implicit definition for the self-energy operator 



EH 



(2.6) 



A formal solution to Eq. (|2.5I) can be constructed by solv- 
ing the nonhermitian eigenvalue equations 

[h + E(w)]|^ n (w)) = £ n (u;)|V>„(w)>, (2.7a) 
[/^ + Et( w )]|x„(w)) = E*(u)\Xn(u)), (2.7b) 

which are also referred to as Dyson's equations. It is usu- 
ally assumed that the solutions to (|2.7|l form a complete 
biorthonormalZ 2 ^ set with the properties 



(xnMIVvM) = <W) 

£|V>nM><XnMI=l> 



(2.8a) 
(2.8b) 



although this is difficult to prove in general. 73 If Eqs. 
(|2.8|) are valid, then G(lo) is given by^Zi 



(2.9) 



which satisfies the Dyson equation (|2.5|l by construction. 

Symmetries of the many-particle Hamiltonian H under 
time reversal and space group operations imply corre- 
sponding symmetries of the one-particle operators G and 
E. Derivations of the most useful symmetry relations are 
presented in Appendix IBI 



III. VERTEX FUNCTIONS 

In this section, a perturbative approach to the Dyson 
equation l|2.7a|) is developed by using vertex functions 
to expand the self-energy in powers of the perturbing 
potential. 



A. Definitions 

The single-particle Hamiltonian h is chosen here to 
have the form 

/W(x,x') = -iV 2 i(x-x')4^+^(x,x'), (3.1) 

where the fixed external potential w ext is a norm- 
conserving ionic pseudopotential^ "^^ 2 " which accounts 
for both spin-orbit coupling 75,76,77 - 78 and scalar relativis- 
tic effects. The Dyson equation (|2.7a|l is therefore 



f 



V 2 Vs(x,w) 



E 



V M /(x,x / ,w)Vv(x',a;)dV 

= E(u)ip a (x,u), (3.2) 



in which ip is a spinor wave function, and the total po- 
tential energy V is 

y ss ,(x,x',w)=<^(x,x / )+E ss ,(x,x / ,o;). (3.3) 

The external pseudopotential can be separated as 
v cxt _ v (o) _j_ v ^ wnere v (o) j s |- ne potential of some peri- 
odic reference crystal, and v is a nonperiodic perturba- 
tion associated with a heterostructure or an impurity. It 
is assumed that the total potential (|3.3|) can be repre- 
sented as a power series in the perturbation v: 



V SlS2 (x 1 ,r 1 ;x 2 ,r 2 ) = V(12) = V<">(12), (3.4) 

u=0 



where is the potential 1)3. 3|l when v cxt — v^°\ and 
the numerical arguments on the right-hand side are short- 
hand for the space, spin, and time coordinates (1) = 
(xi,si,ti). Although the upper limit of the formal ex- 
pansion H3.4|) is written as v = oo, this may well be an 
asymptotic series, and in practice only a finite number of 
terms are retained. 

The linear and quadratic terms of l|3.4|l are 



V~ (1) (12) = r (1) (1243)u(34), 
7( 2 )(12) = ir^(124365M34M56), 



(3.5) 



where is called the vertex function of order v. Here 
a summation or integration of repeated coordinates is 
assumed, and the labels are ordered as the trace of a 
matrix product. The perturbation v is taken to be an 
instantaneous static potential of the form 



v(34) = w S3S4 (x 3 ,X4)i5(t 3 - r 4 ). 



(3.6) 



The vertex functions are by definition functional deriva- 
tives of V with respect to v; 



T (2) (124365) = 



S 2 V(12) 
Sv(34)8v(56) 
<5rW(1243) 

<5i>(56) ' 



(3.7) 
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which may also be expressed as 



IV. NONLINEAR DENSITY RESPONSE 



I'M (1243) = 5(13)5(24) + 
r(2) ( 124365 )=5»6)' 



(3.8) 



in which 5(12) = 5 slS2 5(xi — X2)5(ti — ra). Note that 
upon application of the Fourier transforms defined in 
Appendix [21 the above equations hold equally well in 
momentum and frequency space. 

It is convenient at this point to carry out the time inte- 
grals in Eq. (|3.5() . This eliminates the variables Ts,T4, . . . 
from I^") and v, and reduces the time dependence of the 
5(13)5(24) term in Eq. (|3~%|) to 5(n - r 2 ). It is assumed 
below that this has been done. 



B. Ward identities 

The vertex functions satisfy various Ward identi- 
tiesZ2i22i2i for certain limiting values of their arguments. 
One set of these can be derived by varying the chemical 
potential p by a small amount 5/i. Since K = H — pN, 
this is equivalent to varying v by 5u(12) = —5/i 5(12). 
For this special case, Eqs. (|3.7|) and (|3.8|) reduce to 



r( 1 )(1233) = 5(12)-^M. 

5/i 

5TW (1243) 



(3-9) 



T (2) (124355) 



5/i 



which involve a trace over the input variables of T. If £ 
is analytically continued as a function of u> = p + i£ n , the 
variation with respect to p may be written as 



_5_ 

5/i 



d_ 



d_ 
dp' 



(3.10) 



Now for the special case of an insulator at T = 0, 
the chemical potential can have any value in the range 
/i^Y— i < /i < Pn, where pn is the minimum energy 
needed to add one particle to the ground state of an TV- 
particle system, and the energy gap is E g = /ijv — Pn-i- 
[For small temperatures p approaches the well-defined 
limit \(pn + HN-i)r^ but at exactly T = it becomes 
ill defined.] Since p can vary arbitrarily within the gap 
for a system with finite E g , the Ward identities for the 
insulator reduce to 



T (1) (1233) = 5(12) 



9S(12) 
dui 



r9-w x <9rW(1243) /0 11 , 

r< 2 > 124355 = ± '-, (3.11) 

OLO 

r( 2 )(123344) = ^^, 

which generalize and extend the results derived for spin- 
independent local potentials in Refs. l58l and l80l 



A. Definitions 

In this section, perturbation theory (see Appendix |D|) 
is used to evaluate the electron density of the system with 
Hamiltonian H = Hq + Hi, in which Hq is the Hamilto- 
nian of the reference crystal and Hi is the perturbation 
due to v: 

H x = tr(pw) = p s/s (x',x)v ss ,(x,x')d 3 xd 3 x', 



where p is the density operator 

p s > s (x',x) = tpl(x)ip s/ (x'). 



(4.1) 



(4.2) 



The mean nonlocal electron density in the perturbed sys- 
tem is defined as 



<(x,x') = (p s y(x,x',r)), 



(4.3) 



which is independent of r. If n is evaluated using the 
perturbation theory formula (|D2fl . one obtains a power 
series in v. 



,(x,xO=X>M(x,x'), 



in which 



,((») 



;(x,x') = (p ss /(x,x',r))o 



(4.4) 



(4.5) 



is the density of the reference crystal. (The notation (O)q 
refers to a thermal average with respect to the reference 
crystal; see Appendix iDl) The terms of order v > are 
given by 

n M(00') = InM (00', l'l, . . . , v'v)v(\V) ■ ■ ■ vivv 1 ), 

(4.6) 

where IT^' is the i/ th -order static polarization (or den- 
sity correlation function), which is defined in Eq. (|E1|) . 
Here and below, the numerical arguments of rS v \ I^ u \ 
and v are time-independent quantities of the form (0) = 
(x , s ). 



B. Ward identities 

The linear and quadratic density response are given 
explicitly by 

rjW(i2) = II«(1243>(34), 

n( 2 ) (12) = in (2) (124365M34)t;(56), 
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which shows that II may be defined as a functional deriva- 
tive of n with respect to v: 



n (1) (1243) 
n (2) (124365) 



(4.8) 



5n(12) 
Sv(M) ' 

5 2 n(12) 
5v(34)5u(56) 
5r±W(1243) 

5u(56) 



For the special case Sv(12) = —8 fid (12) representing a 
variation in chemical potential of Sfi (see Sec. lIII Bf . these 
expressions give 



n (1) (1233) 
n (2) (124355) 



dn(12) 
dfi 

an( 1 )(i243) 

dfi 



(4.9) 



which are the Ward identities for the static polarization. 
For an insulator at T = 0, fi is indefinite, and these 
reduce to 



n (1) (1233) = 0, 11^(124355) = 



(2)/ 



(4.10) 



V. SCREENING 



A. Potential 



In this section, the concept of screening is used to 
extract the long-range Coulomb interaction terms from 
the polarization and vertex functions. The first-order 
screened potential tp is defined by adding the Coulomb 
potential generated by tv- 1 ' to v. 

</j(12) = v(12) + u(1243)n (1) (3465)u(56). (5.1) 

Here u represents the Coulomb interaction, which is spin- 
independent and local in coordinate space at both the 
input and output: 

u(1243) = <5 SlS2 <5 S3S4 <5(xi - x 2 )5(x 3 - x 4 )u(xi - x 3 ). 

(5.2) 

In momentum space (see Appendix [UJ) this has the form 



it(1243) = 5 SlS2 5 S 3 S4 5k 1 -k 2 ,k 3 -k 4 w(ki - k 2 ), (5.3) 

in which u(k) ~ v c (k)/f2, where f2 is the crystal volume 
and 



v c (k) 



4ir/k 2 if k ^ 0, 



(5.4) 



if k = 0. 

Another way of writing the screened potential is 

tp(12) = £- 1 (1243)u(34), (5.5) 



in which the inverse static electronic dielectric matrix is 
e -1 (1243) = 5(13)5(24) + w(1265)n (1) (5643). (5.6) 
The dielectric matrix e satisfies 

e(1243)e -1 (3465) = e" 1 (1243)e(3465) = 5(15)5(26), 

and is given explicitly below in Eq. I|5.17f) . For an insu- 
lator, the Ward identity l|4.10|l yields 

e -1 (1233) = 5(12). (5.8) 

The second-order potential (p^ is just the Coulomb po- 
tential generated by n^: 

<^ (2) (12) = w(1243)n (2) (34). (5.9) 

By translation symmetry, Il( 1 )(1243) = unless ki — 
k 2 = k 3 — k 4 + G, where G is a reciprocal lattice vector 
of the reference crystal. Equation 1)5.1(1 may therefore be 
written as 

¥w(k,k') = v ss >(k,k') + 8 ss ,v c (q)^2^2 

k" G 

x n V A(q; k", k" + q + G)v xx >(k" + q + G, k"), 

(5.10) 

where q = k — k' and 

ilv A (q; k,k + q+G) 

= ^ E n ia,A'A(q + k 2 , k 2 ; k, k + q + G). (5.11) 
k 2 

This simplified form of II is introduced because the 
Coulomb potential depends only on the local spin- 
independent density n(x) = n QQ (x, x). 

B. Vertex functions 

Given Eqs. I|5.5|l and (|5.9|l . one can rewrite the total 
potentials l|3.5|l as 



V {1) (12) = f (1) (1243M34), 
))ip(i 

+ f (1) (1243)^ (2) (34), (5.12) 



V {2) (12) = if (2) (124365)^(34)<^(56) 



in which the proper vertex function f is defined in 
perturbation theory as the sum of all i/ th -order vertex di- 
agrams that cannot be separated into two disconnected 
parts by cutting one Coulomb interaction line or one elec- 
tron propagator. (An alternative definition would be as 
the V th functional derivative of V with respect to <£>^1SA) 
From the above results, V and T are related by 

T (1) (1243) = f (1) (1265)e~ 1 (5643), 
T (2) (124365) = f (2) (128709)e- 1 (7843)e- 1 (9065) 

+ f (1) (1287)w(8709)n (2) (904365). (5.13) 
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In an insulator, the Ward identities 1(4.10(1 and (|5.8|l yield 

T (1) (1233) = f (1) (1244), 
T (2) (124355) = f (2) (128799)e~ 1 (7843), (5.14) 
T (2) (123344) = f (2) (125566). 

Hence, for insulators, the Ward identities ((3.11(1 are valid 
for both T and f. 

C. Polarization 

In a similar fashion, one can define the proper polar- 
ization II as the sum of all static polarization diagrams 
that cannot be split by cutting a Coulomb line. Thus 

II (1) (1243) = IT (1) (1265)e" 1 (5643), (5.15) 

which has the form of a Dyson equation^ 

IT (1) (1243) = ft (1) (1243) + n (1) (1265)w(5687)II (1) (7843) 

= ft (1) (1243) + II (1) (1265)w(5687)n (1) (7843). 

(5.16) 

This can be used to verify that the dielectric matrix 

e(1243) = 5(13)5(24) - u(1265)ft (1) (5643) (5.17) 

is indeed the inverse of Eq. 1)5.6(1 . Equations 1(5.15(1 and 
(15.16(1 can also be written as 

nW(1243) = e- 1 (6512)n( 1 )(5643), (5.18) 

in which the symmetry property ((E4() was used. The total 
and proper quadratic polarizations are likewise related by 
(see Fig. 5 of Ref. HJ 

±I (2) (124365) = e" 1 (8712)n (2) (78092'l')e~ 1 (9043) 

x e - 1 (l'2'65). (5.19) 

In an insulator, Eqs. (O and (|5~l"5|) give ftW(1233) = 
nW(1244) = 0, which impli es tha t e(1233) = 6(12). The 
inverses of Eqs. ((5.15(1 and ((5.19(1 . i.e., 

fl (1) (1243) = n (1) (1265)e(5643), (5.20) 

ft (2) (124365) = e(8712)n (2) (78092T)e(9043)e(l'2'65), 

(5.21) 

then show that the insulator Ward identities ((4.1(J|) are 
valid for both II and II. 

Since the Coulomb interaction depends only on the 
reduced polarization matrix ((5.11|) , Eq. 1(5.16(1 can be re- 
duced to 

IW (q + G; k, k + q + G') = U ss ' (q + G; k, k + q + G') 

+ n(q + G, q + G> c (q + G") 

G" 

x ILv(q + G";k,k + q+G'), (5.22) 



in which a scalar version of II is defined by 

IT(q,q+G) = ^n AA (q;k,k + q + G). (5.23) 
k 

Now v c (q + G") is nonsingular in the limit q — > when 
G" 7^ 0, so it is convenient to regroup the series expan- 
sion of Eq. 1(5. 22|) so as to isolate the terms w c (q); 85,86 

Tl ss , (q + G; k, k + q + G') - P sa . (q + G; k, k + q + G') 
+ P(q + G, qH(q)II sa , (q; k, k + q + G'). (5.24) 

Here P is the sum of all polarization diagrams that can- 
not be separated by cutting a Coulomb line labeled with 
q (although they may be split by cutting lines labeled 
q + G" with G" ^ 0). This will be called the regular 
polarization; it is related to the proper polarization n by 
Eq. i(5~22jl with n -> P and G" ^ 0. 

Both ft and P are well behaved in the limit q — > 0, but 
P is more convenient for analysis of the small-g behavior 
of n because, unlike the case for II, it does not require 
the inversion of matrices (see Ref. |83 for further discus- 
sion and an alternative approach) . From the relationship 
between P and S, it is apparent that the insulator Ward 
identities (|4. 1U|) for n and ft hold for P as well. 



VI. LINEAR RESPONSE TO A LOCAL 
PERTURBATION 

In this section the properties of the screened potential 
ip are examined in greater detail for the case of a local 
perturbing potential^ 8 , which by definition has the form 

w(x,x') = <5(x-x>(x), 

u(k,k') = «(k-k'). 1 ' ' 

Here the spin indices were omitted because a hermitian, 
time-reversal invariant, local potential must be a spin 
scalar (see Sec. IIXII . Contributions from the nonlocal 
part of the perturbation are considered in Sec. IIXI 



A. Screened potential 

With this simplification, all of the polarization matri- 
ces can be reduced to the scalar form ((5. 2311 . and the 
screened potential 1(5.10(1 simplifies toS 

v(q) = «(q) + «c(q) Y, n ( q ' q + G ) w ( q + G )- ( 6 - 2 ) 

G 

Likewise, the local version of Eq. 1(5.2411 is 

n(q + G, q + G') = P(q + G, q + G') 

+ P(q + G,qK(q)n(q,q + G'). (6.3) 
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It is convenient at this point to define a macroscopic 
static electronic dielectric functio: 



,88 



e(k) = l-v c (k)P(k,k) 

= i/[i + « c (k)n(k,k)], 



(6.4) 



which may be used to express II as a function of the 
regular polarization P: 

n(q,q + G) = e- 1 (q)P(q,q + G), (6.5a) 

n(q + G,q) =e- 1 (q)P(q + G,q), (6.5b) 

n(q + G, q + G') = P(q + G, q + G') + P(q + G, q) 

xe- 1 (q) t ; c (q)P(q ! q + G'). (6.5c) 

Here all of the nonanalytic behavior at small q is con- 
tained in the factors V c (q) and e _1 (q). 

Upon substituting (|6.5(l into l|6.2|l . one obtains the 
screened potentials 

v(q) . u c (q) 



y(q) 



e(q) e(q) 



^P(q,q+GHq + G), (6.6) 



G^O 



p(q + G) = v(q + G) + » c (q + G)P(q + G, q)cp(q) 
+ v c (q + G) ^ P(q + G, q + G')v(q + G'), (6.7) 

G'^O 

where both expressions are valid for arbitrary q and G, 
but the latter is more useful for investigating the be- 
havior of Lp in the neighborhood of a nonzero reciprocal 
lattice vector. For small q, the first term in (|6.6|l is the 
macroscopic screening that occurs even for slowly varying 
potentials [with u(k) = for k outside the first Brillouin 
zone] , while the second term is a local- field corrections^ 
arising from the microscopic inhomogeneity of the refer- 
ence crystal. 



B. Power series expansions 

The next step is to establish the small-g proper- 
ties of P. Since the only singular Coulomb terms in 
P(q+G,q+G') are the factors w c (q+G") with G" ^ 0, 
the regular polarization P(q + G, q + G') is analytic for 
q < G m i n , where G m i n is the magnitude of the smallest 
nonzero reciprocal lattice vector. In addition, the sym- 
metry property (|E4|) gives P(x, x') = P(x',x) or 

P(k,k')=P(-k',-k), (6.8) 

while the Ward identity (|4.1(J|) for an insulator implies 
that 



lim P(q, q + G) = 0. 

q-+0 



(6.9) 



From these results, we see that the matrix PGG'(q) = 
P(q + G, q + G') has the Taylor series expansion 



P>o(q) = P$ + Pio 1 + ^oo 1 + 0(q»), 



[4] 



,[6] 



(6.10a) 



P 0G (q) = P^ + Pj& + P®. + + 0(q 5 ), (6.10b) 
P GG <(q) = P [ ° ] G , + Pg ] G , + Pgh + 0{q% (6.10c) 



with Pco(q) = Po,-g(— q)- Here P GG , denotes a general 
polynomial of order I in the Cartesian components of q; 
for example, 



4o 1 =4o 1 (q)=?a^O C 3 , 



(6.11) 



in which Pqq is a constant, and a sum over the Cartesian 
components a and j3 is implicit. 

In the limit of small q, the dielectric function of an 
insulator therefore tends toward a finite but direction- 
dependent limit: 

lim e(q) = e(q) = 1 - inP^^, (6.12) 

q^0 q z 

in which q = q/q. This behavior contributes nonanalytic 
terms in the small-g expansion of e (q): 

^ = [1 + ^(q)(P^ ] + 4o ] ) + ^ 2 (q)(4o ] ) 2 ] 

+ 0(q 6 ), (6.13) 



in which w c (q) = v c (q)/e(q). 



C. Pseudopotential 

To proceed further it is necessary to make some as- 
sumptions about the perturbing pseudopotential u(k). 
This is taken to be a superposition of spherically sym- 
metric ionic perturbatio ns, each o f which has the Gaus- 
sian form used in Refs. I40l4ll42l The pseudopotential 
can therefore be written as 



w(k) = w an (k) +v c (k)p(k), 



(6.14) 



in which v an (k) and /c(k) are entire analytic functions 
of k, and p(k) represents a portion of the pseudocharge 
density of the perturbation [the other portion being given 
by fcVn(k)/4^]. 

For a single ion, these functions have the form of a 
Gaussian times a polynomial in fc 2 ^i^ 



P(k) 



^[l + i(fcr ) 2 ] e -(^> 2 / 2 , 



« an (k) = ±{2nZ v rt + . 9 (fc 2 r 2 )] e ^> 2 / 2 , 



(6.15) 



where Z v is the charge of the ion, r is a core radius 
parameter, and g(x) is a cubic polynomial given in Refs. 
SHandl42l Here p(k) has been defined in such a way that 
its Taylor series contains no term proportional to k 2 : 



p(k) = p Q + p 4 k 4 + p 6 k 6 + 



(6.16) 



Therefore, the only term in t> c (k)p(k) that does not van- 
ish in the limit k — > is the divergent term — AnZy/flk 2 . 
This term has been eliminated^ at k = [by the def- 
inition (|5.4|l of f c (k)] because the ion is assumed to be 
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accompanied by Z v electrons, so that the crystal remains 
neutral after the perturbation. 

If the perturbation contains more than one ion (e.g., 
the quasiatoms defined in Appendix |A"|) . p(k) is just a 
general Taylor series 



p(k) = p + P W + + 



(6.17) 



although the linear response can always be treated as a 
superposition of individual ions. 

For G / 0, i?(q + G) is analytic for q < G, with the 
Taylor series 

«(q + G) = v§ + «W + ug 1 + + (9( 9 4 ), (6.18) 

in which = f(G). A similar expansion is valid for 
u c (q + G). 



D. Effective macroscopic density 

It is now convenient to rewrite Eq. I|6.6|l in a form 
modeled after the familiar expressions for screening in a 
homogeneous system^ 



y(q) 



.(q) 



v c (q)n(q) 
e(q) : 



(6.19) 



in which 



»(<0 = p(q) + ^oo(q)^an(q) + X, P 0G (q)^(q + g) 

G#0 

(6.20) 

is an effective macroscopic electron density, which is 
analytic for q < G m i n . This includes the (partial) 
bare charge p(q), the macroscopic charge induced by 
i>an(q), and the local-field corrections. Note that since 
<p(q) = v(q) + v c (q)n^(q), one can also write n(q) = 
[p(q) + rl ' 1 Hq)] e (q)- Also n °t e that n = po, because 
the second term in l|6.2U[) is 0(q 2 ) and the last term is 
0(q). The leading contributions to <p(q) for small q are 
therefore 



p(q) = PoK(q) + W 2 M{P$ + ^ 



,[6]> 
00 / 



?(q)(4 4 ] ) 2 ] 



[2] , ^[3] 



+ W 2 (q)P^(n^ + #1) + v an (q) + 0(q 3 ), (6.21) 

in which v an (q) is to be replaced by its Taylor series 
expansion. 

The first set of terms in (|6.21|) is proportional to 
po = —Z v /tt. These terms are just the power series 
expansion for po« c (q)/e(q). Such terms are present in 
general, but they vanish for isovalent perturbations (e.g., 
Al substituting for Ga in GaAs). 

The remaining nonanalytic terms depend on nfi , where 
I > 1. The symmetry of may differ from that of p^. 
For example, the term w c (q)n^(q) would contribute a 
dipole field if it were present, but pW vanishes for a spher- 
ically symmetric atom. However, since the symmetry of 



P is the same as that of the reference crystal, the sym- 
metry of n is just the site symmetry at the position of 
the ionic perturbation (i.e., the maximal common sub- 
group of the reference crystal space group and the full 
rotation group at the given atomic site). Therefore, for 
crystals of sufficiently low symmetry (e.g., wurtzite), n.M 
may contribute a dipole field even though pi 1 ! does not. 



E. Special cases 

The general expression (I6.21|) is quite cumbersome and 
is unlikely to be used in its entirety for any particular ma- 
terial system. In many cases one would only be interested 
in retaining terms that are two orders in q higher than the 
lowest nonvanishing term. Thus, for heterovalent pertur- 
bations with po 7^ (i.e., class I of the Introduction), Eq. 
(|6.21l) could be simplified to 

¥>(q) = ««o(0) + w c (q)(po + n [1] + n [2] ) 

+ p w 2 MpW+O(q), (6.22) 

which contains monopole, dipole, and quadrupole terms, 
plus a correction to the monopole term describing the 
wave vector dependence of the dielectric function. For 
isovalent perturbations in crystals with atomic site sym- 
metry that supports a dipole moment (class II) , a suitable 
approximation would be 

¥>(q) = Uan(q) + w c (q)(n [11 + n [2] + n [3] ) 

+ w 2 M)P [ o^+0(q 2 ), (6.23) 

which includes additional octopole terms. Finally, for 
isovalent perturbations in crystals with site symmetry 
that does not support a dipole moment (class III), one 
would use 

V(q) = Wan(q) + w c (q){n [2] + n [3] + n [4] ) 

+ w 2 c (q)P^+0(q 3 ). (6.24) 

As an explicit example, consider the case of isovalent 
substitutions in a crystal with the zinc-blende or diamond 
structure (space group T| or 7 h ), both of which have 
site symmetry Td at the atomic sites. In this case po = 
0, and the only invariants of order q A or lower are 1, 
q 2 , q x q y q z , q 4 , and q 4 + q 4 + q 4 z . Hence, the quadratic 

terms are isotropic {Pqq = P2^a/3, — n2q 2 ) and the 
long-wavelength dielectric function reduces to a constant 
[e(q) = e = 1 — AirP-z]. The leading contributions to <p(q) 
may be written as 

I \ 47 ™2,_, x > , n , „ 2 , n q x q y qz 

¥>(q) = (i - <5 q o) + Ci + C 2 q + c* 3 — §— 



a 



4 i 4 

Qx + Qv 



0(q 3 ), (6.25) 



where Ci is a constant. The terms C3 and C4 represent 
octopole and hexadecapole moments, respectively. 
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F. Nonzero reciprocal lattice vectors 



It also satisfies the Ward identity (|4.10|) : 



Turning now to <p{q+ G), Eq. (|6.7(l can be written in 
the condensed notation 

<p(q+G) = i? G o(qMq)+£ G (q), (6.26) 

in which i?Go(q) and £g(<i) are analytic for q < G m i n : 

^ (q) = {i c(q + G)P G0 ( q ) IfG^o! ^ 



e G (q) = (l-«Go) 



u(q+G)+w c (q + G) 
: Yl ^GG'(q)»(q + G') 

G'^0 



(6.28) 



For G ^ 0, Eqs. (|6.10b|) and pH£)l show that i? G o(q) is 
of order q or higher. The explicit form of the Taylor series 
for i?Go(q) is determined by finding the invariants of the 
group of the wave vector G in the reference crystal, where 
different G vectors are treated as inequivalent. Thus, for 
general G, the linear term is nonvanishing. However, the 
leading term in the Taylor series for £ g (q) (with G ^ 0) 
is a constant. 

Hence, for G ^ 0, the nonanalytic terms in <p(q + G) 
are at least one order in q higher than those in (p(q). In 
the zinc-blende example discussed above, one has 

V (q + G) = C 3 i? G0 ^f^ + XG(q) + 0(9 3 ), (6.29) 

in which XG(q) is analytic, and R G0 is the linear coef- 
ficient in the Taylor series for i?Go(q)- The nonanalytic 
term in i|6.29|) is a hexadecapole moment that is invariant 
with respect to the group of G. 



VII. QUADRATIC RESPONSE TO A LOCAL 
PERTURBATION 

To calculate the quadratic density m 2 )(k), it is help- 
ful to begin by considering the following partial density 
obtained from the local version of Eqs. (|4.7|l . I|5.5|) . and 
(CT1) : 



( k ) = ^E' E n^fckx + Gi.k-ki + Ga) 



2^ ^ 

ki GiG 



x ^(ki + GxMk-ki + Ga), (7.1) 

where the summation on ki is limited to the first Bril- 
louin zone of the reference crystal. Here the proper po- 
larization IT 2 ) (k, ki,k2) vanishes unless k = ki+k2 + G, 
where G is any reciprocal lattice vector. It satisfies the 
symmetry relations (IE4|) . the reduced form of which is 



n( 2 )(k,ki,k 2 ) =n( 2 )(k,k 2) k 1 ) = n( 2 )(-k l5 -k,k 2 ). 

(7.2) 



limn( 2 >(k,ki,k-ki +G 2 ) =0. (7.3) 

With these constraints, the Taylor series expansion of 
the polarization matrix n GG Ga (k, ki, k 2 ) = n( 2 )(k + 
G, ki -(- Gi, k 2 + G 2 ) has a form similar to that given for 
P in Eq. jOty : 



^(k, ki,k 2 ) = k a k ll3 k 2l U^ + 0(fc 4 ), 



n^ (k,k 1 ,k 2 ) = k 1/3 k 2y ii% 00 + o(k 



0i 



n(, 2 GiG2 (k ) k 1 ,k 2 ) = fc C( n« GiG2 + o(fc 2 ) ) 



(7.4) 



rigk lGa (k, k x , k 2 ) = n GGl G 2 + o(k), 



where the order of the leading term is equal to the num- 
ber of G vectors that are zero. Here 0(k n ) denotes a 
term of order fc^fc'fc^, where n = p + q + r. 

The partial quadratic density l|7.1|) generates a 
Coulomb potential v^ 2 \q) = v c (q}h^ (q), which is then 
screened to produce ip( 2 '(q) of Eq. (|5.9|l . This potential 
is calculated by replacing u(q) with v^(q) in Eqs. 
and l|6.7|) . The result may be written as 



j2) ( , _ tt e (q)" (2) (q) 



^(q) 



e(q) 



(7.5) 



where n^ 2 \q) = n^ 2 ^(q)e(q) is an effective "external" 
density 



^ 2 )(q)=E i? OG(q)n^(q + G), (7.6) 



G 



in which 

■Roc(q) 



1 



if G = 0, 



P 0G (qK(q+G) ifG^O. 



(7.7) 



In Eq. JEBJ, n^(q+G) is given by fTT)) . where ip(k+G) 
can be expressed in terms of <p(k) using Eq. I|6.26|) . The 
resulting expression for (|7.6|) can be written as fS 2 ^ (q) = 

^A^(q) + ^s^(q) + "-c^l)' m wn ich 



-,(2) 



,( 2 



(q) = \J2' A ^ kc L- kMkMq - k), 
k 

(q) = jE'[ B ( ( i' k ' c i- k M c i- k ) 



(7.8) 



B(q,q-k,k)#)], 



1W 



(q) = ~E c(q, k ,q- k )- 



Here the functions A, B, and C, which are defined in 
Appendix El have the Taylor series expansions 

A(k, ki,k 2 ) = kaki^jAapj + 0(k 4 ), 
5(k,ki,k 2 ) ^k a k 2l B a7 + 0{k 3 ), (7.9) 
C(k,k l5 k 2 ) = k a C a + 0(k 2 ). 



10 



For q values inside the first Brillouin zone, the func- 
tions A, B, and C in 1)7. 8fl are analytic for all k values 
included in the summation, but <p(k) is nonanalytic at 
k = 0. Therefore it is possible that the if terms in (|7.8|) 

may produce nonanalytic behavior in n^ 2 ^(q). n^\q) is 

(2) 

obviously analytic in q, as is the second term in n B (q). 

(2) 

The first term in n B (q) is as well, since a small varia- 
tion 5q can be eliminated from ip(q — k) with an equal 
variation i5k = Sq. This slightly shifts the zone boundary 
in the summation, but ip(q — k) is analytic at the zone 
boundary, so fl^(q) is analytic for small q. 

(2) 

However, for n A (q) this argument is no longer valid. 
The singularities in (p(k) and <p(q— k) merge when q = 0, 

f 21 

producing nonanalytic behavior in n A (q) at this point. 

f 21 

The contribution from the nonanalytic part of n A (q) is 
examined in Appendix [UJ where it is shown to be neg- 
ligible under all three approximation schemes defined in 
the Introduction. Therefore, only the analytic part of 
n^(q) is retained here. 

The leading contributions to the quadratic screened 
potential arc therefore 

<pW(q) = w c (q)(q a nW + q a qpn^) + 0(q), (7.10) 

where and are Taylor series coefficients for the 

analytic part of n^ 2 \q). The absence of a constant term 
in the power series for fj/ 2 )(q) is a consequence of the 
Ward identity l|7.3[l . Equation (|7.10(l is used in its full 
form only for isovalent class III perturbations. For class 

- (2) 

II, the n a p term is negligible, while for class I, the entire 
contribution from (q) is negligible^ 

In the vicinity of a nonzero reciprocal lattice vector, 
f 1,2 (q + G) can be written in a form similar to l|6.26[) : 

^ 2 )(q + G) = i? G0 (q)^ (2) (q) +4 2) (q), (7-H) 



in which (q) is given by Eq. I|6.28[l with v(k) replaced 
by ?/ 2 )(k) = v c (k)r7,( 2 ) (k). Using the same type of anal- 
ysis as before, one finds that the nonanalytic part of 
£^(q) is 0{q) for class I, 0(q 3 ) for class II, and 0(q 5 ) 

(2) 

for class III. Therefore, the limit £ G (0) is well defined, 
and the leading terms in tp( 2 ' 1 (q + G) are given by 



^ 2 >(q + G) = w c (q)(q a qpn^R^ ) + £ ] (0) + 0(q). 

(7-12) 

Since the leading terms here are O(q ), this contribution 
is negligible for classes I and II. 

Note that the quadratic response for a heterostructure 
cannot be written as a superposition of spherically sym- 
metric atomic perturbations; one must also include di- 
atomic perturbations with axial symmetry Cqc (for a 
heteronuclear diatomic molecule) or (for a homonu- 
clear diatomic molecule)^ The symmetry of n^ 2 ^(q) is 
determined by the maximal common subgroup of the 
reference crystal space group and the molecular point 
group. For example, for a perturbation at neighboring 
atomic sites in zinc-blende, the symmetry of n^^q) is 

— (2) 

Cs v , which supports a nonvanishing dipole moment rv a . 

In general, is nonvanishing for any heteronuclear 
perturbation, because Coo V itself permits the existence of 
a dipole. Such dipoles therefore always appear in hetero- 
structures involving more than one type of atomic per- 
turbation (e.g., InAs/GaSb). (This property of the non- 
linear response was deduced from numerical calculations 
of band offsets in Ref. l95h Furthermore, the quadrupole 

term n^g — ^x\^ a * s nonvan ishing for any diatomic 
perturbation, since isotropy requires cubic symmetry. 



I 

VIII. SELF-ENERGY FOR A LOCAL PERTURBATION 



A. Linear terms 



The above results may now be used to calculate the self-energy S and the total potential V defined in Eqs. i|3.3|) 
and H5.12fl . The total linear potential (|5.12(l is given for the case of a local perturbation by 

U« (k + G, k' + G'; u>) = f £ (k + G, k' + G'; q + G"; u)<p(q + G"), (8.1) 

G" 

where q = k k'. Here y?(q + G") can be expressed in terms of <p(q) using Eq. (|6.26|) ; this yields 

U S ( S V (k + G, k' + G'; u) = A ss , (k, k'; G, G'; w)<p(q) + W$ (k, k'; G, G'; u), (8.2) 

in which 

Iui s V(k,k';G,G»= ]T rfJ(k+G,k' + G';q + G";a;)^(q) (8.3) 
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is an analytic function of k and k' (and therefore also of q). The nonanalytic terms are all contained in the screened 
potential <p(q) = v^iio) + Vci^nio) / {(q) i which is multiplied by the effective macroscopic vertex function 

A ss , (k, k'; G, G'; w) = £ f $ (k + G, k' + G'; q + G"; w)i? G » (q). (8.4) 

G" 

Since A ss / is an analytic function of k and k', it can be expanded in a Taylor series [treating q = k — k' and 
Q = ~(k + k') as the independent variables], with the result 

A ss , (k, k'; G, G'; u) = [S SS ,S GG , - 911,°) (G, G'; u)/du>] + g«A£ IO (G, G'; u) + Q q A^, q) (G, G'; w) 

+ 9Q ^A^/ 3| - ) (G,G';a;) + Q^A^G, G» + fe Q (3 A<f(G,G» + 0(g 3 ). (8.5) 

Here the lowest-order term is determined by the Ward identity (|3 - 1 1 1) . and the Taylor series coefficients such as A^"}^ 
are given in Appendix [H] The analytic potential (|8.3|1 can be expanded in the same way: 

W$ (k, k'; G, G';w) = W { J (0, 0; G, G'; u) + q a W^ (G, G';u) + Q a W [ s f ] (G, G'; w) 

+ q a q W { s f^(G, G'; u ) + Q a Q W^ 0) (G, G'; w ) + feQ^ 1 " 3 (G, G'; W ) + 0(q 3 ). (8.6) 

In the expansion Q8.5JI . a term such as q a appears in the total potential 18.2|l as a multiplicative factor in front of 
the screened potential y(q). In coordinate space, this term therefore takes the gradient of <p(x), generating the a 
component of the macroscopic electric field produced by the perturbation u(x). Likewise, the term Q a has the form 
of a (symmetrized) crystal momentum operator that acts upon the envelope functions in an effective-mass theory. 
The Taylor series <|8.6[1 for the analytic potential (|8.3|) is interpreted in the same way, except that these terms produce 
only short-range localized potentials because they are analytic functions of q. 

The various terms in Eq. I|8.5|l therefore give rise to various long-range spin-dependent potentials whose particular 
form is determined by the symmetry of the coefficients A,"J \ etc. The specific term that generates the long-range 
Rashba effect is A^j^\ since this term is linear in the electric field <7 a </?(q) and linear in the crystal momentum Qp. 
The usual short-range part of the Rashba coupling is generated by the analogous term wj"'^ in Eq. i|8.6[l. 

The complete expression for is obtained by inserting the expansion l|8.5|l for A and one of the three expansions 
(|6.22[1 . (|tj.23fl . or iltj.24fl for <^(q) into Eq. (|8.2|) . For the specific example of isovalent perturbations in zinc-blende 
materials treated in Eq. I|6.25|) . one finds 

^'(k + G, k' + G'; lo) = L^°{[ Sss , 5gg , _ fl S (°) ( G , G'; u)/du] [^n 2 q 2 /e + C 3 q x q y q z + C 4 (qt + q A y + qf)] 

+ C 3 q x q y q z [q a A { ™ ] ;\G, G';u) + Q a A^(G, G»]} + analytic terms + 0(q 3 ), (8.7) 

where the analytic terms include and contributions from the analytic part of y(q). From this result it can be 

seen that the Rashba effect in isovalent zinc-blende materials does not include any long-range terms (to within the 

accuracy of the present approximation scheme), since A,™ contributes only to 0(q 3 ). However, there are other 
long-range spin-splitting terms of 0(q 2 ) or lower, and the Rashba effect does contribute nonnegligible long-range 
terms for perturbations in classes I and II. 

B. Quadratic terms 

Turning now to the quadratic response, the two contributions to in Eq. I|5.12|l will be denoted T/( 2a ) and V^ 2h \ 
respectively. The first of these is given by 

Vf, a) (k + G,k' + G';a0 = i^'^^ffJ(k + G,k' + G';k 1 + G 1 ,q-k 1 + G 2 ; W )^(k 1 + G 1 )( /3 (q-k 1 + G 2 ). (8.8) 

ki Gj G 2 

Upon inserting Eq. (|6.26l) for (p(k + G) into the right-hand side, one obtains an expression for V {2&) very similar to 
that found in Eqs. (|7.8|l and IjFlfl for the effective quadratic density (q). Just as before, there are both analytic and 
nonanalytic contributions. The nonanalytic contributions can be evaluated using the method outlined in Appendix 
IU1 the results show that the nonanalytic terms in I/( 2a ) are 0(q _1 ) for class I, O(q) for class II, and 0(q 3 ) for class 
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III. (An explicit expression for the leading nonanalytic term in class I was given previously by ShamiSi2&) Therefore, 
the nonanalytic contributions are negligible in all three cases, and the leading term in U^ 2a ) is just a constant: 

(k + G, k' + G'; u) = (G, G'; u) + 0(q). (8.9) 

This term is negligible under the approximation schemes for classes I and II. 
Finally, the second contribution to in Eq. Ij5.12|l is given by 

V# b) (k + G, k' + G'; w) = J2 f fj(k + G, k' + G'; q + G"; w)<pW ( q + G"), (8.10) 

G" 

where tp( 2 \q + G") was given previously in Eq. (|T. lip . Inserting this expression into Eq. H8.10[l . one obtains 

U s ( g 2 , b) (k + G, k' + G'; w ) - A SS ' (k, k'; G, G'; ^)^(q) + lujf >(k, k'; G, G'; u), (8.11) 
in which A was defined in Eq. I|8.4|l . and 

^ b) (k,k';G,G';c)= £ f£)(k + G, k' + G'; q + G"; w)$l (q). (8.12) 

Unlike the case for Eq. I|8.3[l . this is not an analytic function of q. However, as discussed below Eq. (|7.11|l . the 
nonanalytic portion is 0(q) and therefore vanishes at q — 0. An explicit expression for V (2h ^ can now be obtained by 
inserting the expansion i|7.1U|) for tp( 2 \q) into Eq. JSTHJ: 

V s f ] (k + G, k' + G'; w) = q a w c (q){n^ [5 SS ,S GG , - 0E$(G, G'; u)/du + (feA^G, G'; w) + Q P A^\G, G'; u)]} 
+ q a q w c (q){nW[6 ss ,5 GG , - 9SfJ(G, G'; u)/du]} + W^(0, 0; G, G'; u) + 0(q). (8.13) 

The first and second terms are dipole and quadrupole potentials, respectively, while the last term is just a constant. 
In class II, only the leading 0(q^ 1 ) dipole term is retained; in class I, the entire expression l|5.13|l is neglected. 



IX. NONLOCAL PERTURBATIONS 

An arbitrary nonlocal potential can be separated into 
local and nonlocal parts (although this separation is not 
unique) : 

Uss ,(x ) x')=^(x,xO+< 1 (x J x'). (9.1) 

Here the local part v'°/(x, x') has the form of Eq. (|6.1() 
and is treated according to the methods developed above. 
This section considers the changes in the preceding ex- 
pressions that may be necessary in the case of nonlo- 
cal perturbations, particularly those involving spin-orbit 
coupling. 



A. Analytic form 

A general nonlocal potential can be written as 

v ss > (x, x') = S aa >v (x, x') + (T ss > ■ v(x, x'), (9.2) 

where vq is a scalar relativistic potential, er is the Pauli 
matrix, and v is a pseudovector (similar to orbital angu- 
lar momentum) that accounts for spin-orbit coupling. If 
v is hermitian and time-reversal invariant, then v is real 
and symmetric, while v(x, x') = — v(x',x) is imaginary 



and antisymmetric. Thus v can have no local compo- 
nent, and a local time-reversal- invariant potential must 
be a spin scalar: 

i£?(x,x') = 5s*5(x - x'Kc(x). (9.3) 

In the norm-conserving pseudopotential formalism, the 
nonlocal part of the ionic pseudopotential u"], (x, x') 
is confined to a small region near the nucleus, typi- 
cally either having the form of a polynomial times a 
Gaussian^S^i^ 2 - or vanishing absolutely outside a core re- 
gion of radius r c i2L2£ As a result, w"]/(k, k') is an entire 
analytic function of k and k'. 

It is important to note that this analytic form relies 
upon a physical approximation. In an all-electron cal- 
culation where the pseudopotential approximation is not 
used, the spin-orbit coupling does in general include a 
contribution from the long-range Coulomb part of the 
ionic potential. The choice of an analytic pseudopoten- 
tial u"], (k, k') is therefore an approximation, in which the 
spin-orbit coupling is assumed to be dominated by the 
contribution from the ionic core. Conventional norm- 
conserving pseudopotentials incorporate all relativistic 
corrections of order Z 2 a 2 (where Z is the atomic number 
and a is the fine-structure constant), but neglect various 
terms of order a 2 j22ilffi including the spin-orbit coupling 
from the long-range (but slowly varying) Coulomb poten- 
tial outside the core region. This approximation is used 
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in all that follows. It greatly simplifies the analysis of 
spin-dcpcndcnt perturbations, as shown below. 

B. Screening 

Consider now the description of screening for a spin- 
dependent perturbation. The relationship between the 
total polarization II and the regular polarization P was 
given above in Eq. 15.24fl for a general nonlocal potential. 
Setting G = in this equation gives 

(q; k, k + q + G') = e" 1 (q)P as , (q; k, k + q + G'), 

(9-4) 

where e(q) is the same scalar dielectric function defined 
above in Eq. I|f).4|) . Substituting this result into Eq. (|5.24ll 
then yields 

Tl ss> (q+ G; k, k + q+ G') = P ss > (q + G; k, k + q+ G') 




Equations (|9.4|) and l|9.5|l replace the scalar equations 
(|6.5|) derived previously. 

If the perturbation is now separated into local and non- 
local parts, the linear screened potential H5.10(l can be 
written in a form similar to i|tj.l9fl : 

, n (\r v'\ „™ Or v\ , x «c(q)[^(q) +»ni(q)] 
Vss< (k, k ) = v ss , (k, k ) + d ss , — , 

e (q) 

(9.6) 

in which «£(k, k') = vf s ,(k, k') + <W« an (q), q = k k', 
n(q) is the effective density ()6.2(J[) for the local potential, 
and 

rcm(q) = J2 E ^'A(q; k "> k " + q + G ) 

k" G 

x< A ,(k"+q + G,k") (9.7) 

is a correction to n(q) from the nonlocal part of the per- 
turbation. Since n n i(q) = 0(q) is an analytic function 
with the same site symmetry as n(q), this term does not 
produce any qualitative changes in ip; it merely renor- 
malizes n(q). The only qualitatively new contribution to 
ip is the spin-dependent term v^ l s , (k, k') itself, which is 
analytic. 

For wave vectors in the vicinity of a nonzero reciprocal 
lattice vector, it is convenient to write Eq. (|5.10() in the 
following alternative form: 

p ss ,(k + G,k') =i? G0 (qW(k,k')+^(k,k'). (9.8) 

Here i?Go(q) was defined in Eq. (|6.27[) . and 

C(k,k') = M£G(q) + (l-<5GoH(q+G)7i nl (q+G)] 
+ vf s> (k + G, k') - R G0 (ci)vf s , (k, k') (9.9) 

is a generalization of the function £c(q) defined in Eq. 
(|6.28|) . This is an analytic function of k and k' for q < 



If Eqs. I|9.6|l and (|9.8|) are inserted into the nonlocal 
version of Eq. I|8.1[) [i.e., Eq. (|5.12l) ]. it is apparent that 
the nonlocal part of the perturbation produces no quali- 
tative change in the total linear potential The only 
change is a simple renormalization of the analytic and 
nonanalytic terms in 

The same conclusion also holds for the quadratic po- 
tential . Thus, the correct qualitative form of V^ 1 ) 
and can be derived by ignoring the nonlocal (and 
spin-dependent) part of the perturbing potential, and in- 
cluding spin only in the vertex function A and the an- 
alytic parts of and . This is precisely the ap- 
proach used in Sees. El EEQ and lvTlll 

The key to obtaining this simple result is the fact that 
u"*/ (k, k') is analytic. As shown above, this relies upon 
the approximation of neglecting spin-orbit coupling out- 
side the atomic cores. Such an approximation would 
also be possible (and even desirable for its simplicity) 
in an all-electron calculation where the core electrons are 
treated explicitly. 



X. SUMMARY AND CONCLUSIONS 

This paper has presented an analysis of the self-energy 
of an electron in a lattice-matched semiconductor het- 
erostructure for small values of the crystal momentum. 
A general theory of nonlinear response for nonlocal spin- 
dependent perturbations was developed in terms of ver- 
tex functions and the static polarization, and applied to 
the case of quadratic response in a periodic insulator at 
zero temperature. A set of Ward identities was estab- 
lished for nonlocal spin-dependent potentials. The het- 
erostructure perturbation was separated into a local spin- 
independent part and a nonlocal spin-dependent part, 
and the contributions from these were analyzed sepa- 
rately. Due to the neglect of spin-orbit coupling outside 
the atomic cores, the nonlocal part of the potential is 
analytic in momentum space. As a result, the nonlocal 
part of the perturbation merely renormalizes the contri- 
butions from the local part. 

The main results of the paper are presented in Eqs. 
lHHl, and iPTTTjl . The total linear potential l|8~S)l 

has the form = Kp+W^ x \ in which all of the nonan- 
alytic contributions come from the screened scalar poten- 
tial ip. This has a form ip(q) = i> an (q) + w c(q)«-(q)A(q) 
similar to that for screening in a homogeneous system, 
except that the effective density n(q) has the site sym- 
metry of the perturbation and the macroscopic dielectric 
function e(q) has the symmetry of the reference crystal. 

Spin-dependent contributions come from the analytic 
part and the vertex function A. The vertex func- 

tion can be expanded in a Taylor series (|8.5|) , in which q a 
takes the gradient (in coordinate space) of y(q), while Q a 
is a crystal momentum operator in effective-mass theory. 
The generalized Rashba effect comes from the term linear 
in q a and Qp, but there are other spin-splitting contribu- 
tions from the lower-order terms as well. A more detailed 
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analysis of the various terms is given in the following pa- 
per on effective-mass theory— 

The total quadratic potential of Eqs. (|8.9() and (|8.11() 
has a similar form — Aip^ + W^ 2 \ in which 
is analytic to within the accuracy of the approxima- 
tion scheme defined in the Introduction. The quadratic 
screened potential is (p^ 2 - 1 (q) = v c (q)n^ (q) / e(q) , where 
the effective external density (q) has the site symme- 
try of a diatomic perturbation in the reference crystal. 
Due to the Ward identities for an insulator, the leading 
term in the power series expansion of n' 2 ' (q) is a dipole 
term. The Rashba term in the quadratic potential is 
always negligible under the approximation scheme used 
here. 

The results derived in this paper are used to develop 
a first-principles effective-mass theory in the following 
paper The present results are of crucial importance in 
establishing clearly defined limitations on the validity of 
this theory. Most previous formulations of effective-mass 
theory have been based on non-self-consistent empirical 
pseudopotentials, for which the possibility of long-range 
Coulomb interactions is not even considered. However, 
as shown here, long-range potentials arising from non- 
analytic terms in the screening potential — and even the 
charge density itself — must be considered in general. 

The omission of such terms is partially justified (to a 
certain order of approximation) in isovalent zinc-blende 
systems, where high crystal symmetry eliminates the con- 
tributions from dipole and quadrupole terms in the linear 
response. 51 However, it is not fully justified even in zinc- 
blende, since the leading octopole terms are of a lower 
order than the position dependence of the effective mass, 
which is often included in heterostructure effective-mass 
calculations. The following paper— accounts for all terms 
of the same order as the position dependence of the ef- 
fective mass, including the octopole and hexadecapole 
potentials derived here in Eqs. l|tj.25fl and 1)8.7(1 . 

A pioneering paper by Sham on effective-mass theory 
for shallow impurity states^ dealt with many of the same 
issues (for local spin-independent potentials), but at a 
lower order of approximation. In particular, Sham con- 
sidered only the lowest-order terms in cubic crystals. At 
this level of approximation, the total polarization can be 
treated as analytic [see Eq. (4.8) of Ref. whereas 
the present Eq. (|6.5fl shows that this is no longer true 
for terms of higher order (such as those needed for the 
analysis in Ref. f59l) or crystals of lower symmetry. The 
present work provides a systematic framework for extend- 
ing Sham's analysis to crystals of general symmetry and 
terms of arbitrary order. 

The value of establishing such a framework is demon- 
strated by the result (|7.1U|) derived here for the leading 
dipole term in the quadratic density response. Sham has 
stated that the quadratic density response contains no 
dipole termsjiSi but the justification for this statement 
is not clear because no details of his calculation were 
given. However, numerical evidence to the contrary was 
subsequently provided by Dandrea, Duke, and Zunger 



in a first-principles study of band offsets in InAs/GaSb 
superlattices— They deduced that the calculated differ- 
ence between the macroscopic interface dipoles for InSb 
and GaAs interfaces must be a nonlinear effect (because 
such differences do not exist in linear response theory^i 
in cubic crystals), but did not inquire further into its 
origin. 

To the author's knowledge, the present derivation pro- 
vides the first direct explanation for their result, and 
the first demonstration that dipole terms are a gen- 
eral feature of the quadratic density response. The 
magnitude of such dipoles is small — contributing 50 to 
100 meV to the band offset of typical no-common-atom 
heterojunctions^iS^ — but they play an important role 
in explaining the experimentally observed asymmetry of 
band offsets in such systems— £ 

As a final note, it is worth drawing attention to a fun- 
damental property of the nonlinear response of insula- 
tors that apparently is not widely known. For exam- 
ple, in Refs. l5ll and Baroni et al. have pointed out 
that "within linear response theory, the electronic charge 
induced by a given perturbation is proportional to the 
charge of the perturbing potential,"— which implies that 
"within linear response theory, isovalent substitutional 
impurities carry no net charge."— Although the restric- 
tion to linear response theory is necessary in general, the 
results derived here (in Sec. IVIIjl demonstrate that the 
quadratic density response of an insulator to a charged 
perturbation also carries no net charge (i.e., it vanishes in 
the limit of small wave vectors). Indeed, upon replacing 
Eqs. H4.10fl and (|5.19|) with their higher-order generaliza- 
tions, one finds that this statement remains true for the 
nonlinear density response (|4.tj[l of arbitrary order. 

This result stems from the Ward identities l|4.10|l for 
the total static polarization and proper polarization (see 
Sec. IV CJ) of an insulator at zero temperature. As a con- 
sequence of these identities, one can therefore state that 
in an insulator, the total electronic charge induced by 
a given perturbation is exactly linearly proportional to 
the charge of the perturbing potential. Of course, this 
statement assumes that the system remains insulating 
over the full range of the perturbation (from zero to full 
strength); otherwise, the perturbation theory used here 
is no longer valid. 
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APPENDIX A: REDUCING HETEROVALENT 
PERTURBATIONS TO ISOVALENT 
PERTURBATIONS 

In a heterovalent system such as Ge/GaAsfSi the ionic 
perturbations are from class I. However, it is often pos- 
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sible to reduce such problems to an equivalent class II 
or III problem, because accumulations of macroscopic 
charge are energetically unfavorable; therefore, the in- 
terfaces grown in real heterojunctions tend to be macro- 
scopically neutral. 

For Ge/GaAs, an ideal (110) heterojunction is already 
neutral, but an ideal (001) or (111) interface would have 
a large macroscopic interface charge, leading to a large 
compensating interface free-carrier density^ Since this 
is not observed experimentally, the atoms in a real inter- 
face are believed to be arranged in one or more layers of 
mixed composition, such that the net macroscopic inter- 
face charge is zeiom^^ (This is similar to the concept of 
surface reconstruction, but the interface layers differ from 
the bulk only in chemical composition, not structure.) 
In such a system it is possible to replace the heterova- 
lent ionic perturbations with a set of equivalent isovalent 
perturbations, simply by grouping the atoms together in 
clusters. 

The first step is to define quasiatomic building blocks 
using a modified version of Evjen's technique>i2£ Let 
fi (r) be a Wigner-Seitz cell (of the reference crystal) 
that is centered on position r, and let Af be the number 
of atoms in any primitive cell of the reference crystal. For 
a given atom a at position r a in the heterostructure, the 
quasiatomic potential v a is defined in terms of the ionic 
potentials v a > for all atoms a' via 

v a = 22w a >(a)v a >. (Al) 

a' 

Here w a >(a) is a weight factor, defined as w a <(a) = 1/N 
if atom a' lies inside f2 (r a ), w a <{a) = if a' lies outside 
fio( r a), and w a i{a) — 1/mM if a' lies on the surface of 
^o( r a) [where m is the number of cells flo(r a + R) that 
share atom a', with R any Bravais lattice vector of the 
reference crystal]. In a bulk crystal, these quasiatoms 
are neutral objects with the site symmetry of atom a in 
the reference crystal. Therefore, in a heterostructure, the 
quasiatoms carry a charge only near the heterojunctions. 

For Ge/GaAs, each quasiatomic building block is con- 
structed from i of the potential for a given ion plus | 
of the potential for each of its four nearest neighbors. 
In a bulk zinc-blende crystal, these quasiatoms have Td 
symmetry and possess no charge, no dipole moment, and 
no quadrupole moment. Therefore, in a heterostructure, 
the quasiatoms have monopole, dipole, and quadrupole 
moments only near the heterojunctions. 

It is assumed here that the ions in the mixed- 
composition interface layers form a periodic array, so 
that a two-dimensional superlattice translation symme- 
try exists in the directions parallel to the junction plane. 
In this case, one can define a three-dimensional "slab- 
adapted" 104-105 unit cell of quasiatoms that is large 
enough to contain 100% of the ions in the mixed- 
composition layers. This unit cell has no net charge (since 
the interface is assumed to be macroscopically neutral), 
and for some choices of compositional mixing, it may also 
have no dipole moment^ - 



Thus, if one treats these slab-adapted unit cells as 
the fundamental perturbations, this type of heterovalent 
class I perturbation can be replaced by an equivalent 
neutral perturbation from class II or class III. In gen- 
eral the interface cells do have a dipole moment, so the 
interface perturbations are class II, while the bulk pertur- 
bations are class III. However, an interface dipole term 
of 0(q -1 ) is physically equivalent to a bulk quadrupole 
term of 0(q°). Therefore, the approximation scheme de- 
fined in the Introduction yields results of the same overall 
accuracy for both the class II interface and class III bulk 
perturbations in Ge/GaAs. 

APPENDIX B: SYMMETRY PROPERTIES 

This appendix considers some symmetry properties of 
G and E. Time-reversal symmetry is developed from the 
properties of zero- and one-particle states. The vacuum 
state |0) is defined to be time- reversal invariant: 

e|o) = |o), (bi) 

where O is the antiunitary time-reversal operator. The 
phase of G) may be partially defined by letting the single- 
particle basis states |x, s) — i/4(x)|0) satisfy 

9|x, S ) = (-l) s - 1 / 2 |x,- S ). (B2) 

This relation is consistent with the operator equation 106 

e#(x)0t = (-lj-^l^x). (B3) 

One may therefore define over the entire many-particle 
Fock space by Eqs. (|Blll and (|B3f) . 
The next step is to use the identityi2£ 

(f3\A\a) = 0\eA&\a)* = {a\dA*&\ft, (B4) 

in which A is a linear operator and \a) = Q\a). If the 
many-particle Hamiltonian H is time-reversal invariant 
(i.e., [0,-ff] = 0), one has 

9[4(x, r)]t6t = (-l)-Va^t a(Xi _ r)j (B5 ) 

which holds for complex r. From this and Eq. i|B4|) one 
immediately obtains 

G^(x,r;x',r') = (-l) s - s 'G_ s ,._ s (x', -r'; x, -r), 

(B6) 

which is the generalization of an ordinary Green-function 
"reciprocity" relation 1 ^ to the interacting-particle case. 
(A similar expression was given in Ref. l79l but with the 
sign term omitted.) Since the change of time variables in 
(IB6|) does not alter t — t', the Fourier transform of l|B6|) 
is just 

G 5S ,(x,x» = (-l) s - s 'G_ s ,,_ s (x',x, W ). (B7) 
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Now H is time-reversal invariant if and only if h is, which 
from (|B2[1 and (|B4() implies that 

/W(x,x') = (-l) s - s V s ,,_ s (x',x). (B8) 

Equation (|2.6(l then shows that E has the same time- 
reversal properties as G: 

£ ss ,(x,x» = (-l) s - s '£_^_ s (x',x,^). (B9) 

G and E may also satisfy other conditions derived 
from linear symmetries of H. Consider the linear many- 
particle operator Q defined by 

Q = J2 I! t t (x)9 ss -(x,x>,(x')ArfV, (BIO) 



in which q is a linear single-particle operator. Q obeys 
the commutation relations 

[Mx),Q]=J2 y"fe'(x,x')?Mx')^', (Blla) 

s' 

[#(x),£] = -E/ ^(x')gy s (x',x)dV. (Bllb) 

The commutator of any two such operators is another 
operator with the same form: 



[<3i,<?2] = Q3; Q3 = [<?1,92] 



(B12) 



Now suppose that the Hamiltonian has the symmetry 
[K,Q] = [H,Q] = 0. From (|El2ji . this is pos sible only 
if [ft, q] = 0. One can then use Eqs. (|2.1J) and (|B11|> and 
the cyclic property of the trace to show that 



[G,q]=0, 



(B13) 



which further implies that [E, q] = 0. 

This result is applied to lattice translations throughout 
this paper, and to other space group operations in the 
following paper, 59 Also of interest in the present paper is 
the spin operator s SS '(x, x') = i<x ss /<5(x — x'), where er is 
the Pauli spin matrix vector. If ft is independent of spin 
(i.e., [ft, s] = 0), then [G,s] — 0, and G and E have the 
scalar form 



E as /(x,x',w) = 5 SS /E(x,x',w). 



(B14) 



However, if ft includes spin-orbit coupling (which is the 
case studied here), E is nondiagonal. 



APPENDIX C: FOURIER TRANSFORMS 



Since V(t,t') = V(t — r'), the latter integral is always 
diagonal in n: 



(C3) 



For many-variable quantities such as the vertex function 
the Fourier integrals for (k, k'; q, q'; . . .) have 
the same form as (|C1|1 for each pair of (k, k') variables. 

For a function of the form /(x) = /(r)YJ m (x), where 
y ; m is a spherical harmonic, the Fourier transform is of 
the form /(k) = /(fc)F/"(k), where for k > 0, 

a r°° 
f(k) = -^H) 1 J r 2 /Wj/(fcr)dr, (C4) 

in which ji(kr) is a spherical Bessel functio n. Fo r the 
special case f(r) — r~ n , Eq. (6.561.14) of Ref. 1 1091 gives 



/(fc) " fi ( %) 2«-ir[i(/ + n)] * ' 



(C5) 



in which T(z) — J °° e l t z 1 dt. Equation ljC5(l is valid 
for k > 0, n > 1, and I > n - 3^ 



APPENDIX D: PERTURBATION THEORY 

The starting point for the perturbation theory used in 
Sec. HVl is the standard formula&Sl 



(Mr)) 



(T T [i / (r)^]) 



(Dl) 



where Ah{t) is a Heisenberg picture operator, Ai{t) is 
the same operator in the interaction picture, (O) denotes 
the thermal average with respect to K, (O) is a 

thermal average with respect to Kq = Hq — fiN, and 
U = T T {cxp[— Hi(t)cIt]}. IfU is expanded in a power 
series, terms of equal order in the numerator and denom- 
inator can be grouped together as 



00 (—\\ u r@ 
(A h (t)) = (A i (t))o + J2 [ -^ L / dn 



d,T„ 



x (r T [A#i(n) • ■■AH 1 (t„)AA i (t)}) , (D2) 

where AA^h{t) = Ai,h{t) — (Ai(t))q. A more compact 
expression for l|D2|l is 



(AA h (t)) = (T T [A^(r)W]) , (D3) 
where W = T r {exp[- ^ AH^dr]}. 



The Fourier transforms of the potential with respect 
to momentum and frequency are defined by 



1 



V(k,k') = - I / e- ,kx y(x,x')e lk ' x 'AA', (CI) 
" Jn Jn 

^(Cn,Cn') = 4 f f e^V(T,T')e-<^'dTdr'. 
P Jo Jo 

(C2) 



APPENDIX E: POLARIZATION 



The static polarization l|4.6|l is defined by 



n M (00',ll',...,iV) 



rP rP 

/ dn--- dr u D^(00',ll',...,i/iy'), (El) 
Jo Jo 
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where D is the dynamic polarization 

£>M(00',11',...,m/) 

= (-iy{T T [Ap(00')Ap(W) ■ ■ ■ ApH])o. (E2) 

Here a superfluous second time variable has been added 
(for notational convenience) to the interaction picture op- 
erators according to the definition 

p ss , (x, r; x', t') = p ss * (x, x', r - r'), (E3) 

where r' = 0. Since D^"' is periodic (with period 0) in 
all time variables t\, but depends on time only via the 
intervals t\ — tq (for A = 1, 2, . . . , u), it follows that II^) 
is independent of time. 

By definition, D is symmetric with respect to inter- 
change of any pair of operators A/5; thus 

n v (•••,«' «',...) 

» ! ;••••././' <>■■■■■■ (E4) 



The functions A, B, and 



A(k,ki,k 2 ) 

S(k,k!,k 2 ) 

C(k,ki,k 2 ) 



APPENDIX G: NONANALYTIC TERMS IN n^°(q) 

To leading order, the term n^\q) in Eq. Ij7.8j) is 

nf(q) = 2 Aa ^ qa 12 k P (*r _ M v(k)^(q-k) + 0(g 3 ), 

(Gl) 

in which A Q ^ 7 is the Taylor series coefficient (|7.9|l . The 
only contribution to Eq. i|Gl|) that is of order q 2 comes 
from the monopole terms in <^(q). For any perturbation 
comprising a finite number of atoms, the expansion (|6.21(l 
begins as ip(q) — ~Z v w c (q)/il + 0(q~ 1 ), where Z v is the 
net valence charge of the ionic perturbation. 

If this lowest-order term is considered, the value of Eq. 
(|G1|I can be estimated by extending the summation to 
all values of k. For a cubic crystal with scalar e, the con- 



Another constraint on D can be derived from time- 
reversal symmetry using the methods of Appendix [5] 



£> (v) (00', 11', . . . , w') = (-1) S L» M (0'0, T'T, . . . , v'v). 

(E5) 

Here (A) = (xa,-sa,ta) and 



S = 5>a- S 'a). (E6) 

A=0 



These symmetries are valid for the static polarization n 
as well (with the time variables omitted). 



(El) 



I 

volution can be performed using the Fourier transforms 
in Appendix [HI the result is 

_(2)/ \ 71-2 , qaipq-f . ^, 3 x fri r,\ 

n>A (q) = gj^J Aa ^ — q + °( q >- ( G2 ) 
This shows explicitly that the leading nonanalytic term 
in f#>(q) is 0(q 2 ). 

However, this term exists only for the heterovalent per- 
turbations of class I in the Introduction. For the isova- 
lent perturbations (Z v = 0) of classes II and III, there 
is no 0(q 2 ) term in n^\q). A similar analysis shows 
that for class II, n^\q) = 0(q A ), while for class III, 

n^?(<l) — 0(q 6 ). Therefore, according to the approxima- 
te) 

tion scheme adopted in this paper, n A (q) is negligible 
in all three cases. 



I 

APPENDIX F: FUNCTIONS A, B, AND C 

C introduced in Eq. (|7.8|) are defined by 



= E E E ^0G(k)n( 2 ) (k + G, ki + G 1; k 2 + G 2 ) J R Gl o(k 1 )i? G2 o(k 2 ), 

G Gi G2 

= E E E fl o G (k)ft^ (k + G, ki + Gi, k 2 + G 2 )£ Gl (k 1 )i? G2 o(k 2 ), 

G Gi G 2 

= E E E ^o G (k)n( 2 ) (k + GM + G1M + G 2 )e Gl (ki)fe (k a ). 

G Gi G2 



18 



APPENDIX H: VERTEX FUNCTION TAYLOR SERIES 



Since the proper vertex function f ^ is an analytic function of k and k', it can be expanded in a Taylor series in 
the variables q — k k' and Q = |(k + k'). This yields an expression similar to Eq. 1)8.5(1 : 

f « (k + G, k' + G'; q + G"; w) - f « (G, G'; G"; w) + <z«f ^(G, G'; G"; w ) + Q a f ^(G, G'; G"; w) 

+ g«g^f ^ l0 (G, G'; G"; a;) + Q a Qpt^\G, G'; G"; a,) + q a Q f £J« (G, G'; G"; to) + 0(q 3 ). (HI) 

The Taylor series coefficients in Eq. (|8.5|) for the effective vertex function A are therefore given by 



fftU(G,G';0;u;) + ]T f£> (G, G'; G"; c)i? G „ , 



A^(G,G»=f^(G,G';0;a;) 1 
A^'->(G, G'; W ) = f £f »(G, G'; 0; W ) + £ [f^G, G'; G"; u,)i&, + f £(G, G'; G"; W )J2$ ] 



(H2) 



Al-i, Q «(G,G';c)=r™(G I G';0;c) I 
A(f(G,G'; W )=fif(G,G';0; W )+ £ f ^(G, G'; G"; U )R&, , 

G"/0 

in which R^ and -Rqq are the Taylor series coefficients for i?co(q)- In these expressions, some special values of the 
coefficients for the case G" = are given by the Ward identity ((3.11(1 : 

9si°J(G,G';c) 



f^(G,G / ;0;o;)=^*GG'- 



dw 



d 0E$(k + G,k + G';w) 



dk a 



dw 



(H3) 



k=0 



f(-' a/3) (G,G';0;a;) 



1 d 2 ^°!(k + G,k + G» 

2 dk a dkp duj 



k=0 
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